Overview
Brought to you by YData
Dataset statistics
| Number of variables | 19 |
|---|---|
| Number of observations | 5986025 |
| Missing cells | 54248 |
| Missing cells (%) | < 0.1% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 3.8 GiB |
| Average record size in memory | 689.5 B |
Variable types
| Numeric | 9 |
|---|---|
| DateTime | 1 |
| Text | 5 |
| Categorical | 4 |
ARREST_BORO is highly overall correlated with ARREST_PRECINCT and 1 other fields | High correlation |
ARREST_PRECINCT is highly overall correlated with ARREST_BORO | High correlation |
KY_CD is highly overall correlated with LAW_CAT_CD | High correlation |
LAW_CAT_CD is highly overall correlated with KY_CD | High correlation |
Latitude is highly overall correlated with Y_COORD_CD | High correlation |
Longitude is highly overall correlated with X_COORD_CD | High correlation |
X_COORD_CD is highly overall correlated with ARREST_BORO and 1 other fields | High correlation |
Y_COORD_CD is highly overall correlated with Latitude | High correlation |
LAW_CAT_CD is highly imbalanced (54.3%) | Imbalance |
PERP_SEX is highly imbalanced (58.2%) | Imbalance |
Y_COORD_CD is highly skewed (γ1 = 36.55155715) | Skewed |
Latitude is highly skewed (γ1 = 33.47851636) | Skewed |
Longitude is highly skewed (γ1 = 389.6236999) | Skewed |
ARREST_KEY has unique values | Unique |
JURISDICTION_CODE has 5028649 (84.0%) zeros | Zeros |
Reproduction
| Analysis started | 2025-10-15 18:39:32.069212 |
|---|---|
| Analysis finished | 2025-10-15 18:41:53.503256 |
| Duration | 2 minutes and 21.43 seconds |
| Software version | ydata-profiling vv4.17.0 |
| Download configuration | config.json |
Variables
ARREST_KEY
Real number (ℝ)
Unique
| Distinct | 5986025 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.2194727 × 108 |
| Minimum | 9926901 |
|---|---|
| Maximum | 2.9874848 × 108 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 45.7 MiB |
Quantile statistics
| Minimum | 9926901 |
|---|---|
| 5-th percentile | 24840689 |
| Q1 | 66256737 |
| median | 90820608 |
| Q3 | 1.7016224 × 108 |
| 95-th percentile | 2.7663064 × 108 |
| Maximum | 2.9874848 × 108 |
| Range | 2.8882158 × 108 |
| Interquartile range (IQR) | 1.0390551 × 108 |
Descriptive statistics
| Standard deviation | 76996740 |
|---|---|
| Coefficient of variation (CV) | 0.63139372 |
| Kurtosis | -0.59847726 |
| Mean | 1.2194727 × 108 |
| Median Absolute Deviation (MAD) | 53228502 |
| Skewness | 0.64987466 |
| Sum | 7.2997941 × 1014 |
| Variance | 5.928498 × 1015 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 279197226 | 1 | < 0.1% |
| 72448178 | 1 | < 0.1% |
| 72390252 | 1 | < 0.1% |
| 72353665 | 1 | < 0.1% |
| 72371834 | 1 | < 0.1% |
| 72203716 | 1 | < 0.1% |
| 72275198 | 1 | < 0.1% |
| 72353662 | 1 | < 0.1% |
| 72165610 | 1 | < 0.1% |
| 72267859 | 1 | < 0.1% |
| Other values (5986015) | 5986015 |
| Value | Count | Frequency (%) |
| 9926901 | 1 | |
| 9926902 | 1 | |
| 9926903 | 1 | |
| 9926904 | 1 | |
| 9926993 | 1 | |
| 9926995 | 1 | |
| 9927084 | 1 | |
| 9927085 | 1 | |
| 9927086 | 1 | |
| 9929788 | 1 |
| Value | Count | Frequency (%) |
| 298748482 | 1 | |
| 298725483 | 1 | |
| 298711176 | 1 | |
| 298711173 | 1 | |
| 298711171 | 1 | |
| 298711170 | 1 | |
| 298710745 | 1 | |
| 298710741 | 1 | |
| 298710736 | 1 | |
| 298710721 | 1 |
ARREST_DATE
Date
| Distinct | 6940 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 45.7 MiB |
| Minimum | 2006-01-01 00:00:00 |
|---|---|
| Maximum | 2024-12-31 00:00:00 |
| Invalid dates | 0 |
| Invalid dates (%) | 0.0% |
PD_CD
Real number (ℝ)
| Distinct | 347 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 884 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 496.35865 |
| Minimum | 0 |
|---|---|
| Maximum | 997 |
| Zeros | 115 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 45.7 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 101 |
| Q1 | 259 |
| median | 503 |
| Q3 | 748 |
| 95-th percentile | 922 |
| Maximum | 997 |
| Range | 997 |
| Interquartile range (IQR) | 489 |
Descriptive statistics
| Standard deviation | 267.2434 |
|---|---|
| Coefficient of variation (CV) | 0.53840786 |
| Kurtosis | -1.0788363 |
| Mean | 496.35865 |
| Median Absolute Deviation (MAD) | 244 |
| Skewness | 0.02461015 |
| Sum | 2.9707765 × 109 |
| Variance | 71419.033 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 101 | 513024 | 8.6% |
| 567 | 423783 | 7.1% |
| 478 | 330145 | 5.5% |
| 511 | 314492 | 5.3% |
| 339 | 305917 | 5.1% |
| 922 | 232804 | 3.9% |
| 849 | 231321 | 3.9% |
| 109 | 229080 | 3.8% |
| 397 | 201538 | 3.4% |
| 969 | 178396 | 3.0% |
| Other values (337) | 3024641 |
| Value | Count | Frequency (%) |
| 0 | 115 | < 0.1% |
| 1 | 36 | < 0.1% |
| 2 | 8 | < 0.1% |
| 4 | 11 | < 0.1% |
| 9 | 23 | < 0.1% |
| 11 | 10 | < 0.1% |
| 12 | 71 | < 0.1% |
| 15 | 839 | < 0.1% |
| 16 | 5963 | |
| 29 | 171 | < 0.1% |
| Value | Count | Frequency (%) |
| 997 | 171 | < 0.1% |
| 973 | 247 | < 0.1% |
| 972 | 178 | < 0.1% |
| 970 | 21 | < 0.1% |
| 969 | 178396 | |
| 968 | 4890 | 0.1% |
| 967 | 4 | < 0.1% |
| 965 | 303 | < 0.1% |
| 963 | 180 | < 0.1% |
| 961 | 564 | < 0.1% |
PD_DESC
Text
| Distinct | 447 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 9169 |
| Missing (%) | 0.2% |
| Memory size | 435.8 MiB |
Length
| Max length | 54 |
|---|---|
| Median length | 41 |
| Mean length | 27.407499 |
| Min length | 6 |
Unique
| Unique | 15 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | STRANGULATION 1ST |
|---|---|
| 2nd row | STRANGULATION 1ST |
| 3rd row | RAPE 3 |
| 4th row | RAPE 1 |
| 5th row | (null) |
| Value | Count | Frequency (%) |
| 3 | 984469 | 5.3% |
| possession | 978149 | 5.3% |
| assault | 770334 | 4.1% |
| controlled | 611387 | 3.3% |
| 4 | 590854 | 3.2% |
| 589565 | 3.2% | |
| 5 | 519746 | 2.8% |
| marijuana | 513430 | 2.8% |
| from | 466541 | 2.5% |
| unclassified | 445571 | 2.4% |
| Other values (563) | 12154335 |
Most occurring characters
| Value | Count | Frequency (%) |
| S | 17804747 | 10.9% |
| A | 13294727 | 8.1% |
| E | 13276527 | 8.1% |
| 13143956 | 8.0% | |
| I | 12781461 | 7.8% |
| N | 11710503 | 7.1% |
| O | 8660835 | 5.3% |
| T | 8175873 | 5.0% |
| L | 8040767 | 4.9% |
| R | 7694778 | 4.7% |
| Other values (35) | 49226499 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 163810673 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| S | 17804747 | 10.9% |
| A | 13294727 | 8.1% |
| E | 13276527 | 8.1% |
| 13143956 | 8.0% | |
| I | 12781461 | 7.8% |
| N | 11710503 | 7.1% |
| O | 8660835 | 5.3% |
| T | 8175873 | 5.0% |
| L | 8040767 | 4.9% |
| R | 7694778 | 4.7% |
| Other values (35) | 49226499 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 163810673 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| S | 17804747 | 10.9% |
| A | 13294727 | 8.1% |
| E | 13276527 | 8.1% |
| 13143956 | 8.0% | |
| I | 12781461 | 7.8% |
| N | 11710503 | 7.1% |
| O | 8660835 | 5.3% |
| T | 8175873 | 5.0% |
| L | 8040767 | 4.9% |
| R | 7694778 | 4.7% |
| Other values (35) | 49226499 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 163810673 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| S | 17804747 | 10.9% |
| A | 13294727 | 8.1% |
| E | 13276527 | 8.1% |
| 13143956 | 8.0% | |
| I | 12781461 | 7.8% |
| N | 11710503 | 7.1% |
| O | 8660835 | 5.3% |
| T | 8175873 | 5.0% |
| L | 8040767 | 4.9% |
| R | 7694778 | 4.7% |
| Other values (35) | 49226499 |
KY_CD
Real number (ℝ)
High correlation
| Distinct | 76 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 9788 |
| Missing (%) | 0.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 292.87388 |
| Minimum | 101 |
|---|---|
| Maximum | 995 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 45.7 MiB |
Quantile statistics
| Minimum | 101 |
|---|---|
| 5-th percentile | 106 |
| Q1 | 121 |
| median | 341 |
| Q3 | 348 |
| 95-th percentile | 677 |
| Maximum | 995 |
| Range | 894 |
| Interquartile range (IQR) | 227 |
Descriptive statistics
| Standard deviation | 177.80655 |
|---|---|
| Coefficient of variation (CV) | 0.60710962 |
| Kurtosis | 3.1701389 |
| Mean | 292.87388 |
| Median Absolute Deviation (MAD) | 106 |
| Skewness | 1.5907753 |
| Sum | 1.7502837 × 109 |
| Variance | 31615.169 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 235 | 827688 | 13.8% |
| 344 | 645227 | 10.8% |
| 343 | 332781 | 5.6% |
| 117 | 316371 | 5.3% |
| 341 | 306203 | 5.1% |
| 106 | 288434 | 4.8% |
| 348 | 243577 | 4.1% |
| 677 | 231742 | 3.9% |
| 126 | 222307 | 3.7% |
| 352 | 208608 | 3.5% |
| Other values (66) | 2353299 |
| Value | Count | Frequency (%) |
| 101 | 21030 | 0.4% |
| 102 | 148 | < 0.1% |
| 103 | 364 | < 0.1% |
| 104 | 15291 | 0.3% |
| 105 | 202169 | |
| 106 | 288434 | |
| 107 | 96064 | 1.6% |
| 109 | 164147 | |
| 110 | 23185 | 0.4% |
| 111 | 26035 | 0.4% |
| Value | Count | Frequency (%) |
| 995 | 24804 | 0.4% |
| 882 | 171 | < 0.1% |
| 881 | 183344 | |
| 880 | 8479 | 0.1% |
| 685 | 175 | < 0.1% |
| 678 | 17874 | 0.3% |
| 677 | 231742 | |
| 676 | 519 | < 0.1% |
| 675 | 14276 | 0.2% |
| 672 | 723 | < 0.1% |
OFNS_DESC
Text
| Distinct | 90 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 9169 |
| Missing (%) | 0.2% |
| Memory size | 398.6 MiB |
Length
| Max length | 43 |
|---|---|
| Median length | 33 |
| Mean length | 20.875512 |
| Min length | 4 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | FELONY ASSAULT |
|---|---|
| 2nd row | FELONY ASSAULT |
| 3rd row | RAPE |
| 4th row | RAPE |
| 5th row | (null) |
| Value | Count | Frequency (%) |
| offenses | 1450557 | 7.8% |
| dangerous | 1377233 | 7.4% |
| related | 1239365 | 6.6% |
| drugs | 1144070 | 6.1% |
| 1131867 | 6.1% | |
| assault | 933661 | 5.0% |
| other | 860155 | 4.6% |
| 3 | 656793 | 3.5% |
| laws | 579855 | 3.1% |
| larceny | 493535 | 2.6% |
| Other values (143) | 8814104 |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 13024551 | 10.4% |
| 12704339 | 10.2% | |
| S | 11812500 | 9.5% |
| A | 10267769 | 8.2% |
| R | 8832815 | 7.1% |
| T | 7996095 | 6.4% |
| N | 7629493 | 6.1% |
| O | 7378636 | 5.9% |
| L | 6160459 | 4.9% |
| F | 5488855 | 4.4% |
| Other values (31) | 33474415 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 124769927 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| E | 13024551 | 10.4% |
| 12704339 | 10.2% | |
| S | 11812500 | 9.5% |
| A | 10267769 | 8.2% |
| R | 8832815 | 7.1% |
| T | 7996095 | 6.4% |
| N | 7629493 | 6.1% |
| O | 7378636 | 5.9% |
| L | 6160459 | 4.9% |
| F | 5488855 | 4.4% |
| Other values (31) | 33474415 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 124769927 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| E | 13024551 | 10.4% |
| 12704339 | 10.2% | |
| S | 11812500 | 9.5% |
| A | 10267769 | 8.2% |
| R | 8832815 | 7.1% |
| T | 7996095 | 6.4% |
| N | 7629493 | 6.1% |
| O | 7378636 | 5.9% |
| L | 6160459 | 4.9% |
| F | 5488855 | 4.4% |
| Other values (31) | 33474415 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 124769927 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| E | 13024551 | 10.4% |
| 12704339 | 10.2% | |
| S | 11812500 | 9.5% |
| A | 10267769 | 8.2% |
| R | 8832815 | 7.1% |
| T | 7996095 | 6.4% |
| N | 7629493 | 6.1% |
| O | 7378636 | 5.9% |
| L | 6160459 | 4.9% |
| F | 5488855 | 4.4% |
| Other values (31) | 33474415 |
LAW_CODE
Text
| Distinct | 2627 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 196 |
| Missing (%) | < 0.1% |
| Memory size | 336.8 MiB |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 9.999944 |
| Min length | 2 |
Unique
| Unique | 415 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | PL 1211200 |
|---|---|
| 2nd row | PL 1211300 |
| 3rd row | PL 1302503 |
| 4th row | PL 1303501 |
| 5th row | PL 2407800 |
| Value | Count | Frequency (%) |
| pl | 5037045 | |
| 1200001 | 486407 | 4.4% |
| 2211001 | 406464 | 3.7% |
| 1651503 | 317125 | 2.9% |
| 2200300 | 314492 | 2.8% |
| 1552500 | 305917 | 2.8% |
| loc000000v | 223434 | 2.0% |
| vtl051101a | 165522 | 1.5% |
| 1654000 | 156770 | 1.4% |
| vtl0511001 | 144075 | 1.3% |
| Other values (2622) | 3488064 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 16400705 | |
| 1 | 9267940 | |
| L | 5904738 | 9.9% |
| 2 | 5630682 | 9.4% |
| 5059486 | 8.5% | |
| P | 5046918 | 8.4% |
| 5 | 4723422 | 7.9% |
| 3 | 1502388 | 2.5% |
| 6 | 1346612 | 2.2% |
| 4 | 1208946 | 2.0% |
| Other values (31) | 3766118 | 6.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 59857955 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 16400705 | |
| 1 | 9267940 | |
| L | 5904738 | 9.9% |
| 2 | 5630682 | 9.4% |
| 5059486 | 8.5% | |
| P | 5046918 | 8.4% |
| 5 | 4723422 | 7.9% |
| 3 | 1502388 | 2.5% |
| 6 | 1346612 | 2.2% |
| 4 | 1208946 | 2.0% |
| Other values (31) | 3766118 | 6.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 59857955 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 16400705 | |
| 1 | 9267940 | |
| L | 5904738 | 9.9% |
| 2 | 5630682 | 9.4% |
| 5059486 | 8.5% | |
| P | 5046918 | 8.4% |
| 5 | 4723422 | 7.9% |
| 3 | 1502388 | 2.5% |
| 6 | 1346612 | 2.2% |
| 4 | 1208946 | 2.0% |
| Other values (31) | 3766118 | 6.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 59857955 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 16400705 | |
| 1 | 9267940 | |
| L | 5904738 | 9.9% |
| 2 | 5630682 | 9.4% |
| 5059486 | 8.5% | |
| P | 5046918 | 8.4% |
| 5 | 4723422 | 7.9% |
| 3 | 1502388 | 2.5% |
| 6 | 1346612 | 2.2% |
| 4 | 1208946 | 2.0% |
| Other values (31) | 3766118 | 6.3% |
LAW_CAT_CD
Categorical
High correlation Imbalance
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 24990 |
| Missing (%) | 0.4% |
| Memory size | 285.6 MiB |
| M | |
|---|---|
| F | |
| V | 297792 |
| I | 27200 |
| 9 | 1801 |
Length
| Max length | 6 |
|---|---|
| Median length | 1 |
| Mean length | 1.0000084 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | F |
|---|---|
| 2nd row | F |
| 3rd row | F |
| 4th row | F |
| 5th row | M |
Common Values
| Value | Count | Frequency (%) |
| M | 3866398 | |
| F | 1767834 | |
| V | 297792 | 5.0% |
| I | 27200 | 0.5% |
| 9 | 1801 | < 0.1% |
| (null) | 10 | < 0.1% |
| (Missing) | 24990 | 0.4% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| m | 3866398 | |
| f | 1767834 | |
| v | 297792 | 5.0% |
| i | 27200 | 0.5% |
| 9 | 1801 | < 0.1% |
| null | 10 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| M | 3866398 | |
| F | 1767834 | |
| V | 297792 | 5.0% |
| I | 27200 | 0.5% |
| 9 | 1801 | < 0.1% |
| l | 20 | < 0.1% |
| ( | 10 | < 0.1% |
| n | 10 | < 0.1% |
| u | 10 | < 0.1% |
| ) | 10 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 5961085 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| M | 3866398 | |
| F | 1767834 | |
| V | 297792 | 5.0% |
| I | 27200 | 0.5% |
| 9 | 1801 | < 0.1% |
| l | 20 | < 0.1% |
| ( | 10 | < 0.1% |
| n | 10 | < 0.1% |
| u | 10 | < 0.1% |
| ) | 10 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 5961085 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| M | 3866398 | |
| F | 1767834 | |
| V | 297792 | 5.0% |
| I | 27200 | 0.5% |
| 9 | 1801 | < 0.1% |
| l | 20 | < 0.1% |
| ( | 10 | < 0.1% |
| n | 10 | < 0.1% |
| u | 10 | < 0.1% |
| ) | 10 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 5961085 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| M | 3866398 | |
| F | 1767834 | |
| V | 297792 | 5.0% |
| I | 27200 | 0.5% |
| 9 | 1801 | < 0.1% |
| l | 20 | < 0.1% |
| ( | 10 | < 0.1% |
| n | 10 | < 0.1% |
| u | 10 | < 0.1% |
| ) | 10 | < 0.1% |
ARREST_BORO
Categorical
High correlation
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 8 |
| Missing (%) | < 0.1% |
| Memory size | 285.4 MiB |
| K | |
|---|---|
| M | |
| B | |
| Q | |
| S |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | M |
|---|---|
| 2nd row | K |
| 3rd row | K |
| 4th row | B |
| 5th row | Q |
Common Values
| Value | Count | Frequency (%) |
| K | 1658691 | |
| M | 1592300 | |
| B | 1368787 | |
| Q | 1147992 | |
| S | 218247 | 3.6% |
| (Missing) | 8 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| k | 1658691 | |
| m | 1592300 | |
| b | 1368787 | |
| q | 1147992 | |
| s | 218247 | 3.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| K | 1658691 | |
| M | 1592300 | |
| B | 1368787 | |
| Q | 1147992 | |
| S | 218247 | 3.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 5986017 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| K | 1658691 | |
| M | 1592300 | |
| B | 1368787 | |
| Q | 1147992 | |
| S | 218247 | 3.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 5986017 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| K | 1658691 | |
| M | 1592300 | |
| B | 1368787 | |
| Q | 1147992 | |
| S | 218247 | 3.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 5986017 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| K | 1658691 | |
| M | 1592300 | |
| B | 1368787 | |
| Q | 1147992 | |
| S | 218247 | 3.6% |
ARREST_PRECINCT
Real number (ℝ)
High correlation
| Distinct | 80 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 61.039387 |
| Minimum | 1 |
|---|---|
| Maximum | 483 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 45.7 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 9 |
| Q1 | 33 |
| median | 60 |
| Q3 | 88 |
| 95-th percentile | 115 |
| Maximum | 483 |
| Range | 482 |
| Interquartile range (IQR) | 55 |
Descriptive statistics
| Standard deviation | 34.419945 |
|---|---|
| Coefficient of variation (CV) | 0.56389729 |
| Kurtosis | -1.1129989 |
| Mean | 61.039387 |
| Median Absolute Deviation (MAD) | 27 |
| Skewness | 0.13929039 |
| Sum | 3.653833 × 108 |
| Variance | 1184.7326 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 14 | 208437 | 3.5% |
| 75 | 201899 | 3.4% |
| 44 | 190208 | 3.2% |
| 40 | 177271 | 3.0% |
| 73 | 158637 | 2.7% |
| 46 | 148354 | 2.5% |
| 43 | 144668 | 2.4% |
| 52 | 140123 | 2.3% |
| 25 | 129453 | 2.2% |
| 103 | 128643 | 2.1% |
| Other values (70) | 4358332 |
| Value | Count | Frequency (%) |
| 1 | 72317 | 1.2% |
| 5 | 90035 | |
| 6 | 64254 | 1.1% |
| 7 | 54715 | 0.9% |
| 9 | 61588 | 1.0% |
| 10 | 50372 | 0.8% |
| 13 | 78123 | 1.3% |
| 14 | 208437 | |
| 17 | 30224 | 0.5% |
| 18 | 86402 |
| Value | Count | Frequency (%) |
| 483 | 3 | < 0.1% |
| 123 | 23981 | 0.4% |
| 122 | 49903 | |
| 121 | 34453 | 0.6% |
| 120 | 109910 | |
| 116 | 111 | < 0.1% |
| 115 | 110800 | |
| 114 | 97063 | |
| 113 | 116850 | |
| 112 | 39525 | 0.7% |
JURISDICTION_CODE
Real number (ℝ)
Zeros
| Distinct | 30 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 10 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.2534464 |
| Minimum | 0 |
|---|---|
| Maximum | 97 |
| Zeros | 5028649 |
| Zeros (%) | 84.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 45.7 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 2 |
| Maximum | 97 |
| Range | 97 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 9.2122302 |
|---|---|
| Coefficient of variation (CV) | 7.3495206 |
| Kurtosis | 85.396658 |
| Mean | 1.2534464 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 9.2161239 |
| Sum | 7503149 |
| Variance | 84.865185 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 5028649 | |
| 1 | 501880 | 8.4% |
| 2 | 298656 | 5.0% |
| 3 | 50611 | 0.8% |
| 97 | 34071 | 0.6% |
| 72 | 16985 | 0.3% |
| 4 | 11610 | 0.2% |
| 73 | 7690 | 0.1% |
| 69 | 6908 | 0.1% |
| 6 | 5503 | 0.1% |
| Other values (20) | 23452 | 0.4% |
| Value | Count | Frequency (%) |
| 0 | 5028649 | |
| 1 | 501880 | 8.4% |
| 2 | 298656 | 5.0% |
| 3 | 50611 | 0.8% |
| 4 | 11610 | 0.2% |
| 6 | 5503 | 0.1% |
| 7 | 3635 | 0.1% |
| 8 | 12 | < 0.1% |
| 9 | 573 | < 0.1% |
| 11 | 4308 | 0.1% |
| Value | Count | Frequency (%) |
| 97 | 34071 | |
| 88 | 281 | < 0.1% |
| 87 | 1022 | < 0.1% |
| 85 | 650 | < 0.1% |
| 82 | 2 | < 0.1% |
| 79 | 212 | < 0.1% |
| 76 | 52 | < 0.1% |
| 74 | 130 | < 0.1% |
| 73 | 7690 | 0.1% |
| 72 | 16985 |
AGE_GROUP
Text
| Distinct | 91 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 17 |
| Missing (%) | < 0.1% |
| Memory size | 307.3 MiB |
Length
| Max length | 7 |
|---|---|
| Median length | 5 |
| Mean length | 4.8312054 |
| Min length | 3 |
Unique
| Unique | 50 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 25-44 |
|---|---|
| 2nd row | 25-44 |
| 3rd row | 25-44 |
| 4th row | 45-64 |
| 5th row | <18 |
| Value | Count | Frequency (%) |
| 25-44 | 2874418 | |
| 18-24 | 1491336 | |
| 45-64 | 1115032 | 18.6% |
| 18 | 447699 | 7.5% |
| 65 | 57345 | 1.0% |
| 895 | 13 | < 0.1% |
| 945 | 7 | < 0.1% |
| 894 | 7 | < 0.1% |
| 935 | 7 | < 0.1% |
| 928 | 5 | < 0.1% |
| Other values (81) | 139 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 4 | 9470294 | |
| - | 5480786 | |
| 2 | 4365818 | |
| 5 | 4046839 | |
| 8 | 1939082 | 6.7% |
| 1 | 1939078 | 6.7% |
| 6 | 1172394 | 4.1% |
| < | 447699 | 1.5% |
| + | 57345 | 0.2% |
| 9 | 156 | < 0.1% |
| Other values (8) | 143 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 28919634 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 4 | 9470294 | |
| - | 5480786 | |
| 2 | 4365818 | |
| 5 | 4046839 | |
| 8 | 1939082 | 6.7% |
| 1 | 1939078 | 6.7% |
| 6 | 1172394 | 4.1% |
| < | 447699 | 1.5% |
| + | 57345 | 0.2% |
| 9 | 156 | < 0.1% |
| Other values (8) | 143 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 28919634 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 4 | 9470294 | |
| - | 5480786 | |
| 2 | 4365818 | |
| 5 | 4046839 | |
| 8 | 1939082 | 6.7% |
| 1 | 1939078 | 6.7% |
| 6 | 1172394 | 4.1% |
| < | 447699 | 1.5% |
| + | 57345 | 0.2% |
| 9 | 156 | < 0.1% |
| Other values (8) | 143 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 28919634 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 4 | 9470294 | |
| - | 5480786 | |
| 2 | 4365818 | |
| 5 | 4046839 | |
| 8 | 1939082 | 6.7% |
| 1 | 1939078 | 6.7% |
| 6 | 1172394 | 4.1% |
| < | 447699 | 1.5% |
| + | 57345 | 0.2% |
| 9 | 156 | < 0.1% |
| Other values (8) | 143 | < 0.1% |
PERP_SEX
Categorical
Imbalance
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 285.4 MiB |
| M | |
|---|---|
| F | |
| U | 3504 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | M |
|---|---|
| 2nd row | M |
| 3rd row | M |
| 4th row | M |
| 5th row | M |
Common Values
| Value | Count | Frequency (%) |
| M | 4971527 | |
| F | 1010994 | 16.9% |
| U | 3504 | 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| m | 4971527 | |
| f | 1010994 | 16.9% |
| u | 3504 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| M | 4971527 | |
| F | 1010994 | 16.9% |
| U | 3504 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 5986025 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| M | 4971527 | |
| F | 1010994 | 16.9% |
| U | 3504 | 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 5986025 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| M | 4971527 | |
| F | 1010994 | 16.9% |
| U | 3504 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 5986025 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| M | 4971527 | |
| F | 1010994 | 16.9% |
| U | 3504 | 0.1% |
PERP_RACE
Categorical
| Distinct | 8 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 331.0 MiB |
| BLACK | |
|---|---|
| WHITE HISPANIC | |
| WHITE | |
| BLACK HISPANIC | |
| ASIAN / PACIFIC ISLANDER | 258657 |
| Other values (3) | 71217 |
Length
| Max length | 30 |
|---|---|
| Median length | 5 |
| Mean length | 8.9744737 |
| Min length | 5 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | WHITE |
|---|---|
| 2nd row | BLACK |
| 3rd row | BLACK |
| 4th row | BLACK |
| 5th row | WHITE HISPANIC |
Common Values
| Value | Count | Frequency (%) |
| BLACK | 2904570 | |
| WHITE HISPANIC | 1551099 | |
| WHITE | 705233 | 11.8% |
| BLACK HISPANIC | 495249 | 8.3% |
| ASIAN / PACIFIC ISLANDER | 258657 | 4.3% |
| UNKNOWN | 55942 | 0.9% |
| AMERICAN INDIAN/ALASKAN NATIVE | 13912 | 0.2% |
| OTHER | 1363 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| black | 3399819 | |
| white | 2256332 | |
| hispanic | 2046348 | |
| asian | 258657 | 2.9% |
| 258657 | 2.9% | |
| pacific | 258657 | 2.9% |
| islander | 258657 | 2.9% |
| unknown | 55942 | 0.6% |
| american | 13912 | 0.2% |
| indian/alaskan | 13912 | 0.2% |
| Other values (2) | 15275 | 0.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| I | 7439304 | |
| A | 6578179 | |
| C | 5977393 | |
| H | 4304043 | |
| L | 3672388 | 6.8% |
| K | 3469673 | 6.5% |
| B | 3399819 | 6.3% |
| 2850143 | 5.3% | |
| N | 2801048 | 5.2% |
| S | 2577574 | 4.8% |
| Other values (12) | 10651860 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 53721424 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| I | 7439304 | |
| A | 6578179 | |
| C | 5977393 | |
| H | 4304043 | |
| L | 3672388 | 6.8% |
| K | 3469673 | 6.5% |
| B | 3399819 | 6.3% |
| 2850143 | 5.3% | |
| N | 2801048 | 5.2% |
| S | 2577574 | 4.8% |
| Other values (12) | 10651860 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 53721424 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| I | 7439304 | |
| A | 6578179 | |
| C | 5977393 | |
| H | 4304043 | |
| L | 3672388 | 6.8% |
| K | 3469673 | 6.5% |
| B | 3399819 | 6.3% |
| 2850143 | 5.3% | |
| N | 2801048 | 5.2% |
| S | 2577574 | 4.8% |
| Other values (12) | 10651860 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 53721424 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| I | 7439304 | |
| A | 6578179 | |
| C | 5977393 | |
| H | 4304043 | |
| L | 3672388 | 6.8% |
| K | 3469673 | 6.5% |
| B | 3399819 | 6.3% |
| 2850143 | 5.3% | |
| N | 2801048 | 5.2% |
| S | 2577574 | 4.8% |
| Other values (12) | 10651860 |
X_COORD_CD
Real number (ℝ)
High correlation
| Distinct | 73438 |
|---|---|
| Distinct (%) | 1.2% |
| Missing | 1 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1005375.2 |
| Minimum | 0 |
|---|---|
| Maximum | 1067302 |
| Zeros | 11 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 45.7 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 981343 |
| Q1 | 992932 |
| median | 1004937 |
| Q3 | 1016155 |
| 95-th percentile | 1041879 |
| Maximum | 1067302 |
| Range | 1067302 |
| Interquartile range (IQR) | 23223 |
Descriptive statistics
| Standard deviation | 20268.751 |
|---|---|
| Coefficient of variation (CV) | 0.020160386 |
| Kurtosis | 12.74163 |
| Mean | 1005375.2 |
| Median Absolute Deviation (MAD) | 11564 |
| Skewness | -0.42544777 |
| Sum | 6.0181998 × 1012 |
| Variance | 4.1082229 × 108 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1017119 | 24586 | 0.4% |
| 987220 | 24499 | 0.4% |
| 1006537 | 19960 | 0.3% |
| 1026486 | 18173 | 0.3% |
| 1020183 | 17918 | 0.3% |
| 962822 | 17829 | 0.3% |
| 997897 | 17616 | 0.3% |
| 987078 | 16124 | 0.3% |
| 1005041 | 16053 | 0.3% |
| 1007694 | 15826 | 0.3% |
| Other values (73428) | 5797440 |
| Value | Count | Frequency (%) |
| 0 | 11 | |
| 913357 | 3 | < 0.1% |
| 913411 | 1 | < 0.1% |
| 913463 | 10 | |
| 913512 | 8 | |
| 913554 | 2 | < 0.1% |
| 913626 | 1 | < 0.1% |
| 913682 | 1 | < 0.1% |
| 913818 | 3 | < 0.1% |
| 913844 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 1067302 | 1 | < 0.1% |
| 1067298 | 1 | < 0.1% |
| 1067249 | 5 | < 0.1% |
| 1067226 | 10 | < 0.1% |
| 1067220 | 2 | < 0.1% |
| 1067185 | 25 | |
| 1067151 | 1 | < 0.1% |
| 1067117 | 4 | < 0.1% |
| 1067113 | 6 | < 0.1% |
| 1067053 | 4 | < 0.1% |
Y_COORD_CD
Real number (ℝ)
High correlation Skewed
| Distinct | 77889 |
|---|---|
| Distinct (%) | 1.3% |
| Missing | 1 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 213866.8 |
| Minimum | 0 |
|---|---|
| Maximum | 8202360 |
| Zeros | 11 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 45.7 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 159815 |
| Q1 | 186782 |
| median | 208981 |
| Q3 | 236608 |
| 95-th percentile | 254156 |
| Maximum | 8202360 |
| Range | 8202360 |
| Interquartile range (IQR) | 49826 |
Descriptive statistics
| Standard deviation | 151374.39 |
|---|---|
| Coefficient of variation (CV) | 0.70779751 |
| Kurtosis | 1523.2051 |
| Mean | 213866.8 |
| Median Absolute Deviation (MAD) | 24819 |
| Skewness | 36.551557 |
| Sum | 1.2802118 × 1012 |
| Variance | 2.2914205 × 1010 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 183909 | 24547 | 0.4% |
| 212676 | 24537 | 0.4% |
| 234533 | 20176 | 0.3% |
| 244511 | 19927 | 0.3% |
| 262591 | 18096 | 0.3% |
| 174282 | 17821 | 0.3% |
| 215157 | 16124 | 0.3% |
| 183789 | 15583 | 0.3% |
| 216954 | 15281 | 0.3% |
| 239283 | 15111 | 0.3% |
| Other values (77879) | 5798821 |
| Value | Count | Frequency (%) |
| 0 | 11 | |
| 121131 | 3 | < 0.1% |
| 121152 | 5 | |
| 121174 | 2 | < 0.1% |
| 121219 | 3 | < 0.1% |
| 121250 | 5 | |
| 121282 | 2 | < 0.1% |
| 121312 | 3 | < 0.1% |
| 121343 | 3 | < 0.1% |
| 121390 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 8202360 | 155 | < 0.1% |
| 8187668 | 157 | < 0.1% |
| 7250292 | 144 | < 0.1% |
| 7236187 | 32 | < 0.1% |
| 7220451 | 225 | |
| 7209909 | 295 | |
| 7192044 | 2 | < 0.1% |
| 7186840 | 167 | < 0.1% |
| 6253476 | 107 | < 0.1% |
| 6216843 | 432 |
Latitude
Real number (ℝ)
High correlation Skewed
| Distinct | 186539 |
|---|---|
| Distinct (%) | 3.1% |
| Missing | 5 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 40.753426 |
| Minimum | 0 |
|---|---|
| Maximum | 62.083075 |
| Zeros | 11 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 45.7 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 40.605247 |
| Q1 | 40.679284 |
| median | 40.740236 |
| Q3 | 40.816088 |
| 95-th percentile | 40.864235 |
| Maximum | 62.083075 |
| Range | 62.083075 |
| Interquartile range (IQR) | 0.13680326 |
Descriptive statistics
| Standard deviation | 0.41295175 |
|---|---|
| Coefficient of variation (CV) | 0.010132934 |
| Kurtosis | 1615.1641 |
| Mean | 40.753426 |
| Median Absolute Deviation (MAD) | 0.068142762 |
| Skewness | 33.478516 |
| Sum | 2.4395082 × 108 |
| Variance | 0.17052915 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 40.75043077 | 19911 | 0.3% |
| 40.67141166 | 19850 | 0.3% |
| 40.64502275 | 17821 | 0.3% |
| 40.81039849 | 16047 | 0.3% |
| 40.83778162 | 15797 | 0.3% |
| 40.75724053 | 14915 | 0.2% |
| 40.68004873 | 14517 | 0.2% |
| 40.64886713 | 14429 | 0.2% |
| 40.88733282 | 14334 | 0.2% |
| 40.82338729 | 14304 | 0.2% |
| Other values (186529) | 5824095 |
| Value | Count | Frequency (%) |
| 0 | 11 | |
| 40.49890536 | 3 | < 0.1% |
| 40.49895701 | 5 | |
| 40.49902536 | 2 | < 0.1% |
| 40.49914279 | 3 | < 0.1% |
| 40.49922879 | 5 | |
| 40.4993236 | 2 | < 0.1% |
| 40.499393 | 1 | < 0.1% |
| 40.49940083 | 2 | < 0.1% |
| 40.49948686 | 3 | < 0.1% |
| Value | Count | Frequency (%) |
| 62.08307498 | 155 | < 0.1% |
| 62.0459844 | 157 | < 0.1% |
| 59.65727395 | 144 | < 0.1% |
| 59.62096122 | 32 | < 0.1% |
| 59.58050882 | 225 | |
| 59.55331498 | 295 | |
| 59.50737263 | 2 | < 0.1% |
| 59.49372016 | 167 | < 0.1% |
| 57.07018725 | 107 | < 0.1% |
| 56.97414271 | 432 |
Longitude
Real number (ℝ)
High correlation Skewed
| Distinct | 188004 |
|---|---|
| Distinct (%) | 3.1% |
| Missing | 5 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -73.923571 |
| Minimum | -74.254939 |
|---|---|
| Maximum | 0 |
| Zeros | 11 |
| Zeros (%) | < 0.1% |
| Negative | 5986009 |
| Negative (%) | > 99.9% |
| Memory size | 45.7 MiB |
Quantile statistics
| Minimum | -74.254939 |
|---|---|
| 5-th percentile | -74.010474 |
| Q1 | -73.968669 |
| median | -73.925311 |
| Q3 | -73.884691 |
| 95-th percentile | -73.792139 |
| Maximum | 0 |
| Range | 74.254939 |
| Interquartile range (IQR) | 0.083977468 |
Descriptive statistics
| Standard deviation | 0.12396636 |
|---|---|
| Coefficient of variation (CV) | -0.001676953 |
| Kurtosis | 232362.93 |
| Mean | -73.923571 |
| Median Absolute Deviation (MAD) | 0.041741562 |
| Skewness | 389.6237 |
| Sum | -4.4250798 × 108 |
| Variance | 0.015367658 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| -73.98928218 | 19911 | 0.3% |
| -73.88151172 | 19850 | 0.3% |
| -73.91945797 | 16955 | 0.3% |
| -74.07721685 | 16618 | 0.3% |
| -73.98979364 | 14915 | 0.2% |
| -73.87999831 | 14828 | 0.2% |
| -73.92489531 | 14694 | 0.2% |
| -73.77590919 | 14517 | 0.2% |
| -73.9508219 | 14429 | 0.2% |
| -73.84725001 | 14334 | 0.2% |
| Other values (187994) | 5824969 |
| Value | Count | Frequency (%) |
| -74.25493874 | 3 | < 0.1% |
| -74.25474319 | 1 | < 0.1% |
| -74.25455981 | 10 | |
| -74.254377 | 8 | |
| -74.25422295 | 2 | < 0.1% |
| -74.25395113 | 1 | < 0.1% |
| -74.25376703 | 1 | < 0.1% |
| -74.253256 | 3 | < 0.1% |
| -74.25318724 | 3 | < 0.1% |
| -74.253187 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 11 | < 0.1% |
| -73.68178027 | 169 | |
| -73.68478838 | 167 | |
| -73.70029335 | 1 | < 0.1% |
| -73.70031586 | 1 | < 0.1% |
| -73.70049339 | 5 | < 0.1% |
| -73.70056786 | 4 | < 0.1% |
| -73.70057651 | 6 | < 0.1% |
| -73.70059685 | 2 | < 0.1% |
| -73.700717 | 6 | < 0.1% |
Lon_Lat
Text
| Distinct | 198169 |
|---|---|
| Distinct (%) | 3.3% |
| Missing | 5 |
| Missing (%) | < 0.1% |
| Memory size | 523.1 MiB |
Length
| Max length | 45 |
|---|---|
| Median length | 44 |
| Mean length | 42.630321 |
| Min length | 11 |
Unique
| Unique | 47756 ? |
|---|---|
| Unique (%) | 0.8% |
Sample
| 1st row | POINT (-73.985702 40.76539) |
|---|---|
| 2nd row | POINT (-73.95082 40.648859) |
| 3rd row | POINT (-73.9305713255961 40.6744956865259) |
| 4th row | POINT (-73.9005768807295 40.8535983673823) |
| 5th row | POINT (-73.901881 40.699373) |
| Value | Count | Frequency (%) |
| point | 5986020 | |
| 73.98928217599996 | 20875 | 0.1% |
| 40.75043076800005 | 19911 | 0.1% |
| 40.67141166300007 | 19850 | 0.1% |
| 73.88151172399995 | 19850 | 0.1% |
| 73.91945797099999 | 16955 | 0.1% |
| 74.077216847 | 16618 | 0.1% |
| 40.645022746000045 | 16618 | 0.1% |
| 73.92489531099994 | 16047 | 0.1% |
| 40.810398494000026 | 16047 | 0.1% |
| Other values (374460) | 11809269 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 37180848 | |
| 9 | 33532857 | |
| 7 | 19781514 | 7.8% |
| 4 | 18361995 | 7.2% |
| 3 | 16356123 | 6.4% |
| 8 | 14300140 | 5.6% |
| 6 | 12923052 | 5.1% |
| 11972040 | 4.7% | |
| . | 11972018 | 4.7% |
| 5 | 11890639 | 4.7% |
| Other values (10) | 66914728 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 255185954 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 37180848 | |
| 9 | 33532857 | |
| 7 | 19781514 | 7.8% |
| 4 | 18361995 | 7.2% |
| 3 | 16356123 | 6.4% |
| 8 | 14300140 | 5.6% |
| 6 | 12923052 | 5.1% |
| 11972040 | 4.7% | |
| . | 11972018 | 4.7% |
| 5 | 11890639 | 4.7% |
| Other values (10) | 66914728 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 255185954 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 37180848 | |
| 9 | 33532857 | |
| 7 | 19781514 | 7.8% |
| 4 | 18361995 | 7.2% |
| 3 | 16356123 | 6.4% |
| 8 | 14300140 | 5.6% |
| 6 | 12923052 | 5.1% |
| 11972040 | 4.7% | |
| . | 11972018 | 4.7% |
| 5 | 11890639 | 4.7% |
| Other values (10) | 66914728 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 255185954 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 37180848 | |
| 9 | 33532857 | |
| 7 | 19781514 | 7.8% |
| 4 | 18361995 | 7.2% |
| 3 | 16356123 | 6.4% |
| 8 | 14300140 | 5.6% |
| 6 | 12923052 | 5.1% |
| 11972040 | 4.7% | |
| . | 11972018 | 4.7% |
| 5 | 11890639 | 4.7% |
| Other values (10) | 66914728 |
Interactions
Correlations
| ARREST_BORO | ARREST_KEY | ARREST_PRECINCT | JURISDICTION_CODE | KY_CD | LAW_CAT_CD | Latitude | Longitude | PD_CD | PERP_RACE | PERP_SEX | X_COORD_CD | Y_COORD_CD | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| ARREST_BORO | 1.000 | 0.022 | 0.787 | 0.032 | 0.072 | 0.066 | 0.021 | 0.001 | 0.098 | 0.170 | 0.015 | 0.573 | 0.023 |
| ARREST_KEY | 0.022 | 1.000 | 0.025 | -0.041 | -0.099 | 0.090 | -0.030 | -0.001 | -0.137 | 0.026 | 0.060 | -0.001 | -0.030 |
| ARREST_PRECINCT | 0.787 | 0.025 | 1.000 | -0.116 | -0.020 | 0.047 | -0.475 | 0.406 | 0.013 | 0.142 | 0.005 | 0.407 | -0.475 |
| JURISDICTION_CODE | 0.032 | -0.041 | -0.116 | 1.000 | 0.149 | 0.019 | 0.036 | -0.056 | 0.037 | 0.021 | 0.013 | -0.056 | 0.036 |
| KY_CD | 0.072 | -0.099 | -0.020 | 0.149 | 1.000 | 0.724 | 0.013 | -0.006 | 0.289 | 0.030 | 0.063 | -0.006 | 0.013 |
| LAW_CAT_CD | 0.066 | 0.090 | 0.047 | 0.019 | 0.724 | 1.000 | 0.013 | 0.021 | 0.407 | 0.027 | 0.050 | 0.028 | 0.007 |
| Latitude | 0.021 | -0.030 | -0.475 | 0.036 | 0.013 | 0.013 | 1.000 | 0.266 | -0.031 | 0.005 | 0.004 | 0.264 | 1.000 |
| Longitude | 0.001 | -0.001 | 0.406 | -0.056 | -0.006 | 0.021 | 0.266 | 1.000 | -0.001 | 0.002 | 0.000 | 1.000 | 0.266 |
| PD_CD | 0.098 | -0.137 | 0.013 | 0.037 | 0.289 | 0.407 | -0.031 | -0.001 | 1.000 | 0.041 | 0.100 | -0.001 | -0.031 |
| PERP_RACE | 0.170 | 0.026 | 0.142 | 0.021 | 0.030 | 0.027 | 0.005 | 0.002 | 0.041 | 1.000 | 0.039 | 0.111 | 0.005 |
| PERP_SEX | 0.015 | 0.060 | 0.005 | 0.013 | 0.063 | 0.050 | 0.004 | 0.000 | 0.100 | 0.039 | 1.000 | 0.013 | 0.004 |
| X_COORD_CD | 0.573 | -0.001 | 0.407 | -0.056 | -0.006 | 0.028 | 0.264 | 1.000 | -0.001 | 0.111 | 0.013 | 1.000 | 0.265 |
| Y_COORD_CD | 0.023 | -0.030 | -0.475 | 0.036 | 0.013 | 0.007 | 1.000 | 0.266 | -0.031 | 0.005 | 0.004 | 0.265 | 1.000 |
Missing values
Sample
| ARREST_KEY | ARREST_DATE | PD_CD | PD_DESC | KY_CD | OFNS_DESC | LAW_CODE | LAW_CAT_CD | ARREST_BORO | ARREST_PRECINCT | JURISDICTION_CODE | AGE_GROUP | PERP_SEX | PERP_RACE | X_COORD_CD | Y_COORD_CD | Latitude | Longitude | Lon_Lat | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 279197226 | 12/19/2023 | 105.0 | STRANGULATION 1ST | 106.0 | FELONY ASSAULT | PL 1211200 | F | M | 18 | 0.0 | 25-44 | M | WHITE | 988210.0 | 218129.0 | 40.765390 | -73.985702 | POINT (-73.985702 40.76539) |
| 1 | 278761840 | 12/09/2023 | 105.0 | STRANGULATION 1ST | 106.0 | FELONY ASSAULT | PL 1211300 | F | K | 67 | 0.0 | 25-44 | M | BLACK | 997897.0 | 175676.0 | 40.648859 | -73.950820 | POINT (-73.95082 40.648859) |
| 2 | 278506761 | 12/05/2023 | 153.0 | RAPE 3 | 104.0 | RAPE | PL 1302503 | F | K | 77 | 0.0 | 25-44 | M | BLACK | 1003509.0 | 185018.0 | 40.674496 | -73.930571 | POINT (-73.9305713255961 40.6744956865259) |
| 3 | 278436408 | 12/03/2023 | 157.0 | RAPE 1 | 104.0 | RAPE | PL 1303501 | F | B | 46 | 0.0 | 45-64 | M | BLACK | 1011755.0 | 250279.0 | 40.853598 | -73.900577 | POINT (-73.9005768807295 40.8535983673823) |
| 4 | 278248753 | 11/29/2023 | 660.0 | (null) | NaN | (null) | PL 2407800 | M | Q | 104 | 0.0 | <18 | M | WHITE HISPANIC | 1011456.0 | 194092.0 | 40.699373 | -73.901881 | POINT (-73.901881 40.699373) |
| 5 | 278254593 | 11/29/2023 | 464.0 | JOSTLING | 230.0 | JOSTLING | PL 1652501 | M | M | 18 | 0.0 | <18 | M | WHITE HISPANIC | 990503.0 | 215519.0 | 40.758225 | -73.977428 | POINT (-73.977428 40.758225) |
| 6 | 277850807 | 11/21/2023 | 263.0 | ARSON 2,3,4 | 114.0 | ARSON | PL 1501001 | F | K | 63 | 71.0 | 25-44 | M | WHITE | 1000734.0 | 164367.0 | 40.617813 | -73.940621 | POINT (-73.940621 40.617813) |
| 7 | 276523582 | 10/26/2023 | 177.0 | SEXUAL ABUSE | 116.0 | SEX CRIMES | PL 2603204 | F | M | 28 | 0.0 | 25-44 | M | BLACK | 997407.0 | 233806.0 | 40.808418 | -73.952474 | POINT (-73.9524740603515 40.8084177460021) |
| 8 | 276466505 | 10/25/2023 | 157.0 | RAPE 1 | 104.0 | RAPE | PL 1303501 | F | K | 77 | 0.0 | 25-44 | M | BLACK | 1003509.0 | 185018.0 | 40.674496 | -73.930571 | POINT (-73.9305713255961 40.6744956865259) |
| 9 | 276391494 | 10/24/2023 | 168.0 | SODOMY 1 | 116.0 | SEX CRIMES | PL 1305004 | F | K | 77 | 0.0 | 45-64 | M | WHITE | 1003509.0 | 185018.0 | 40.674496 | -73.930571 | POINT (-73.9305713255961 40.6744956865259) |
| ARREST_KEY | ARREST_DATE | PD_CD | PD_DESC | KY_CD | OFNS_DESC | LAW_CODE | LAW_CAT_CD | ARREST_BORO | ARREST_PRECINCT | JURISDICTION_CODE | AGE_GROUP | PERP_SEX | PERP_RACE | X_COORD_CD | Y_COORD_CD | Latitude | Longitude | Lon_Lat | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 5986015 | 297611126 | 12/06/2024 | 244.0 | BURGLARY,UNCLASSIFIED,UNKNOWN | 107.0 | BURGLARY | PL 1402501 | F | M | 13 | 0.0 | 45-64 | F | BLACK | 985689.0 | 208933.0 | 40.740159 | -73.994807 | POINT (-73.994807 40.740159) |
| 5986016 | 297536476 | 12/05/2024 | 478.0 | THEFT OF SERVICES, UNCLASSIFIE | 343.0 | OTHER OFFENSES RELATED TO THEFT | PL 1651503 | M | B | 52 | 1.0 | 25-44 | M | BLACK | 1013463.0 | 254828.0 | 40.866070 | -73.894382 | POINT (-73.89438159169373 40.866070221647036) |
| 5986017 | 296442940 | 11/13/2024 | 759.0 | PUBLIC ADMINISTATION,UNCLASS M | 359.0 | OFFENSES AGAINST PUBLIC ADMINI | PL 1950500 | M | Q | 101 | 0.0 | 25-44 | M | WHITE | 1051297.0 | 160407.0 | 40.606704 | -73.758533 | POINT (-73.758533 40.606704) |
| 5986018 | 297266769 | 11/30/2024 | 439.0 | LARCENY,GRAND FROM OPEN AREAS, UNATTENDED | 109.0 | GRAND LARCENY | PL 1553501 | F | B | 43 | 0.0 | 18-24 | M | WHITE HISPANIC | 1021611.0 | 245695.0 | 40.840972 | -73.864972 | POINT (-73.864972 40.840972) |
| 5986019 | 298661791 | 12/30/2024 | 113.0 | MENACING,UNCLASSIFIED | 344.0 | ASSAULT 3 & RELATED OFFENSES | PL 1201401 | M | B | 43 | 0.0 | 25-44 | M | WHITE HISPANIC | 1017513.0 | 240674.0 | 40.827216 | -73.879808 | POINT (-73.879808 40.827216) |
| 5986020 | 297470037 | 12/04/2024 | 478.0 | THEFT OF SERVICES, UNCLASSIFIE | 343.0 | OTHER OFFENSES RELATED TO THEFT | PL 1651503 | M | B | 46 | 1.0 | 25-44 | M | WHITE HISPANIC | 1010256.0 | 248770.0 | 40.849453 | -73.906000 | POINT (-73.90599986315169 40.84945286686186) |
| 5986021 | 298196424 | 12/19/2024 | 244.0 | BURGLARY,UNCLASSIFIED,UNKNOWN | 107.0 | BURGLARY | PL 1402000 | F | M | 6 | 0.0 | 18-24 | M | BLACK | 983555.0 | 204888.0 | 40.729056 | -74.002507 | POINT (-74.002507 40.729056) |
| 5986022 | 298499906 | 12/26/2024 | 101.0 | ASSAULT 3 | 344.0 | ASSAULT 3 & RELATED OFFENSES | PL 1200001 | M | K | 83 | 0.0 | 45-64 | F | BLACK | 1007005.0 | 195927.0 | 40.704433 | -73.917928 | POINT (-73.917928 40.704433) |
| 5986023 | 297495137 | 12/04/2024 | 101.0 | ASSAULT 3 | 344.0 | ASSAULT 3 & RELATED OFFENSES | PL 1200001 | M | M | 14 | 0.0 | 25-44 | M | ASIAN / PACIFIC ISLANDER | 986732.0 | 211747.0 | 40.747873 | -73.991040 | POINT (-73.99104 40.747873) |
| 5986024 | 298540265 | 12/27/2024 | 639.0 | AGGRAVATED HARASSMENT 2 | 361.0 | OFF. AGNST PUB ORD SENSBLTY & | PL 2403001 | M | K | 66 | 0.0 | 25-44 | F | WHITE | 986735.0 | 167242.0 | 40.625726 | -73.991049 | POINT (-73.991049 40.625726) |